BIOTECHNOLOGY ^ 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientiflc and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: Zj^fr^-^ 



Cate Processed byjSTIC: I IINlO^ 



THE ATTACHED PRIN^TOLT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 703-308-4212; FAX: 703-308-4221 
Effective 12/13/03 : TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE SEE BELOW FOR ADDRESS: 

httD://www.uspto.gov/web/offices/pac/checker/chkr4 lnote.htm 



Applicants submitting genetic sequciKe information electronically on diskette or CD-Rom should be aware thai there 

a possibihly that the disk/CD-Rom may have been affected by treatment given lo all incoming mail. 

Please consider using allcniatc methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the followinu to the indicated addresses: 

I EFS-Bio (<http://www.iispto.gov/ebc/efs/downloads/documents>htm> , EPS Submission 

User Manual - ePAVE) 
2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 
3 Hand Carry directly to (EFFECTIVE 12/01/03): 

U.S. Patent and Trademark Office, Box Sequence, Customer Windo\y, Lobby, Room IB03, Crystal Plaza Two, 

20 1 1 SouUi Clark Place, Arlington, VA 22202 
4. Federal Express, United Parcel Service, oi; other delivery service to: U S. Patent and Trademark Office, 

Box Sequence, Rocmi 4B03-Mailroom, CiysUil Plaza Two, 201 1 South Clark Pfecc, Arlington, VA 22202 



Revised 10/08/03 



Raw Sequence Listing Error Summary 



K RROR DETECTK D SUGGESTED CORRECTION SERIAL NuMiitK: f 't - ' ' 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA'' HEADERS, WHICH WERE INSERTED BY PTC SOFTWARE 

I Wrapped Nuclcics The numbfer/tcxt at the end of each hne "wrapped" 'down to the next line* *^\^ may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating if/ Please adjust your right margin to .3; this will 
prevent "wrapping." 



^Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

_Misaligned Amino The numbering under each 5*** amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 



Non-ASCII 



Variable Length 



_PatentIn 2.0 
"bug" 



The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 

Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Nomially, Patcntin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Skipped Sequences Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence; 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

Skipped Sequences Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence. 

(NEW RULES) <2I0> sequence id number 

<400> sequence id number 

000 



10 



II 



12 



Use of n's or Xaa*s 
(NEW RULES) 

Jnvalid <213> 
Response 

Use of<220> 



Patentin 2.0 
" "bug" 



Use of n's and/or Xaa*s have been detected in the Sequence Listing. , • 

Per L823 of Sequence Rules, use of <220>-*223> is MANDATORY if n*s or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which re^ue n or Xaa rcprcscnt^y. 

Per 1.823 of Sequence Rules, thtf only valid <213> responses are: Unknown, Artificial Sequence^ or 
scientific name (Genus/spccics). <220>-<223> section is required when <2I3> response is Unknown or 
is Artificial Sequence 

Sequence(s) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <2I3> "Organism" response- is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <J20> to <223> section. 

(Sec "Federal Register," OOrOl/1998, Vol. 63, No. 104, pp. 2963J02) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrupted file, • 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n/Xaa "n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 
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RAW SEQUENCE LISTING DATF,: ll/]4/2003 

PATENT APPLICATION: US/10/699,035 TIME: 10:22:09 

lnp\it Set : A:\molecular marker bateman sequencelisting. t:xt 
Output Set: N:\CRF4\11142003\J699035.raw 



3 <110> APPLICANT: Bateman, John Francis 

4 t'itzgerald, David 

6 <120> TITLE OF INVENTION: A MoJ.ecular Marker 
B <130> FILE REFERENCE: A3605 6- PCT-USA-A 071838.0142 
C~ > 10 <140> CURRENT APPLICATION NUMBER: US/10/699 , 035 

11 <141> CURRENT FILING DATE: 2003-10-31 

13 <I50> PRIOR APPLICATION NUMBER: PCT/AU02/00542 



W — > 14r<150>) PRIOR APPLICATION NUMBER: 2002-05-02 
W~>n.6 <150> PRIOR APPLICATION NUMBER: AU PR4701/01 

^17 <151> PRIOR FILING DATE: 2001-05-02 
19 <160> NUMBER OF SEQ ID NOS : 40 

21 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

ERRORED SEQUENCES ^jj ' 

811 <2]0> SEQ ID NO: 20 OOOG V.-Ompiy 

812 <21]> LENGTH: 418 - • - 'V-- v^M'V-'<^^t*r- NeGd^"*'' 

813 <212> TYPE: PRT v.>'.m;...._ 

814 <2]3> ORGANISM: human ^ i L I ra L 

816 <400> SEQUENCE: 20 / //M^'^ 0< r\(^\rt^ /C^ijfi^ 

817 / (hM 
Met Leu Pro Trp Thr Ala Leu Gly Leu Ala Leu Sor Leu Arg Leu Ala 

819 Leu AJa Arg Ser G] y Ala G.I u Arg Gly Pro Pro Ala Ser Ala Pro Arg 
E~> 820 20 25 30 

822 Gly Asp Leu Met Phe Leu Leu Asp Ser Scr Ala Ser Val Ser His Tyr 
E~> 823 35 40 45 ^MJU^^>-^-^^ 

82b Glu Phe Ser Arg Val Arg GJ u Phe Val Gly Gin Leu Val AJa Pro Leu . 
E~> 826 50 55 60 ljy^.4jy\ X^cr^^ 

828 Pro Leu Gly Thr Gly Ala Leu Arg Ala Ser Lou Val His Val Gly Ser 0 

E— > 829 65 70 75 80 

831 Arg Pro Tyr Thr Glu Phe Pro Phe Gly G]n His Ser Ser GJ y Glu Ala 
E~> 832 85 90 95 

834 Aja Gin Asp Ala VaJ Arg Ala Ser Ala Gin Arg Met Gly Asp Thr His 
E~> 835 100 105 110 I n 1/ ^ ^ J 

837 Thr Gly Leu Ala Leu Val Tyr AJa Lys Glu Gin Leu Phe Ala Glu Ala ( JAM^ 

E~> 838 115 120 125 

840 Ser Gly Ala Arg Pro Gly Val. Pro Lys Val Leu Val Trp Val Thr Asp 
E~> 841 130 135 140 




^5 4- 



843 Gly Gly Ser Ser Asp Pro VaJ Gly Pro Pro Met Gin Glu Leu Lys Asp . ^ ' ^ jJi 

844 145 150 155 160 O^Th^ 

846 Leu Gly VaJ Thr Val Pho He Val Ser Thr Gly Arg Gly Asn Phe Leu KiZ/*x/d</^ 

847 165 170 175 fKAJyTHT^ ^ 
84 9 Glu Leu Ser Ala Ala AJa Ser Ala Pro Ala Glu Lys His Leu His Phe ^^^^{J^\^^xM^^ 



file://C:\CRF4\Outhold\VsrJ69Q01 5 htm 
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RAW SEQUENCE LISTING DATE: 11/14/2003 

PATENT APPLICATION: US/10/699,035 TIME: 10:22:09 

Input Set : A:\molGCular marker bateman sequencelisting.txt 
Output Sot: N:\CRF4\11142003\J699035.raw 



E- 


-> 


850 








180 










185 










190 










852 


Val 


Asp 


Val 


Asp 


Asp 


Leu 


His 


He 


He 


Val 


Gin 


Glu 


Leu 


Arg 


Gly 


Ser 


E- 


-> 


853 






195 










200 










205 












855 


He 


Leu 


Asp 


Ala 


Met 


Arg 


Pro 


Gin 


Gin 


Leu 


His 


Ala 


Thr 


Glu 


He 


Thr 


E- 


-> 


856 




210 










215 










220 














858 


Ser 


Ser 


Gly 


Phe 


Arg 


Leu 


Ala 


Trp 


Pro 


Pro 


Leu 


Leu 


Thr 


Ala 


Asp 


Ser 


E" 


-> 


859 


225 










230 










235 










240 






861 


Gly 


Tyr 


Tyr 


Val 


Leu 


Glu 


Leu 


Val 


Pro 


Scr 


Ala 


Gin 


Pro 


Gly 


Ala 


Ala 


E- 


-> 


862 










245 










250 










255 








864 


Arg 


Arg 


Gin 


Gin 


Leu 


Pro 


Gly Asn 


Ala 


Thr 


Asp 


Trp 


He 


Trp 


Ala 


Gly 


E- 


-> 


865 








260 










265 










270 










8 67 


Leu 


Asp 


Pro 


Asp 


Thr 


Asp 


Tyr 


Asp 


Val 


Ala 


Leu 


Val 


Pro 


Glu 


Ser 


Asn 


E- 


-> 


868 






275 










280 










285 












870 


Val 


Arg 


Leu 


Leu 


Arg 


Pro 


Gin 


He 


Lou 


Arg 


Val 


Arg 


Thr 


Arg 


Pro 


Glu 


E- 


-> 


871 




290 










295 










300 














873 


Glu 


Ala 


Gly 


Pro 


Glu 


Arg 


He 


Val 


He 


Ser 


His 


Ala 


Arg 


Pro 


Arg 


Ser 


E- 


-> 


874 


305 










310 










315 










320 






876 


Leu 


Arg 


Val 


Ser 


Trp 


Ala 


Pro 


Ala 


Leu 


Gly 


Ser 


Ala 


Ala 


Ala 


Leu 


Gly 


E- 


-> 


877 










325 










330 










335 








879 


Tyr 


His 


Val 


Gin 


Phe 


Gly 


Pro 


Leu 


Arg 


Gly 


Gly 


Glu 


Ala 


Gin 


Arg 


Val 


E- 


-> 


880 








340 










345 










350 










882 


Glu 


Val 


Pro 


Ala 


G.ly 


Arg 


Asn 


Cys 


Thr 


Thr 


Leu 


Gin 


Gly 


Leu 


Ala 


Pro 


E- 


-> 


883 






355 










360 










365 












885 


Gly 


Thr 


Ala 


Tyr 


Leu 


Val 


Thr 


Val 


Thr 


Ala 


Ala 


Phe 


Arg 


Ser 


Gly 


Arg 


E- 


-> 


886 




370 










375 










380 














888 


Glu 


Ser 


Ala 


Leu 


Ser 


Ala 


Lys 


Ala 


Cys 


Thr 


Pro 


Asp 


Gly 


Pro 


Arg 


Pro 


E- 


-> 


889 


385 










390 










395 










400 






891 


Arg 


Pro 


Arg 


Pro 


Val 


Pro 


Arg 


Ala 


Pro 


Thr 


Pro 


Gly 


Thr 


Ala 


Ser 


Arg 


E- 


-> 


892 










405 










410 










415 




E- 


-> 


894 


Glu 


Pro 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 11/14/2003 

PATENT APPLICATION: US/lO/699 , 035 TIME: 10:22:10 

Input Set : A:\molecular marker bateman sequencelisting.txt 
Output Set: N:\CRF4\11142003\J699035.raw 

Invalid Line Lengthy 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:20; Line(s) 817 

Use of_<220> Feature_(NEW RULES) ^ 

Sequence (s) ^are missing the <220> Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213> ORGANISM is "Artificial Sequence" 
or"Unknovm" . Please explain source of genetic material in <220> to <223> 
section (See "Federal Register," 6/01/98, Vol, 63, No. 104 , pp. 29631-32) 
(Sec. 1.823 of new Rules) 

SGq#: 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 4 0 




file://C:\CRF4\Outhold\VsrJ699035 htm 



/o I Glials 



<210> 22 
<211> 182 
<212> PRT 



<213> ^artificial sequence 
<400> 22 




<210> 9 

<211> 20 

<212> DN. 
<213>, 



<400> 9 

ctcaaagcca tgcgtagtcc 



20 



I!. 



IIk- ■ 



...... -.1,.. /r.-v^t<;> tiiront'::r>ii? 



si'.iiii.ir (?rrors 



VERIFICATION SUMMARY DATE: 11/14/2003 

PATENT APPLICATION: US/10/699 , 035 TIME: 10:22:10 



Input Set : A:\molecular marker bateman sequencelisting.txt 
Output Set: N:\CRF4\11142003\J699035.raw 



L:10 M:270 C: Current Application Number differs, Replaced Current Application Number 
L:14 M:289 W: Identifier Missing or Out-Of-Order , <150> PRIOR APP FILING DATE 
L:16 M:289 W: Identifier Missing or Out -Of -Order , <150> PRIOR APP FILING DATE 
L:820 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ 10:20 
M:332 Repeated in SeqNo-20 

L:894 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 4 18 Found:402 SEQ:20 
L:986 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:22, <213> 
ORGANISM: artificial sequence 

L:986 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ#:22, <213> 
ORGANISM: artificial sequence 

L:986 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 22, Linet : 986 
L:1028 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:23, <213> 
ORGANISM: artificial sequence 
L:1028 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1028 M:258 W: Mandatory Feature missing 
L:1070 M:258 W: Mandatory Feature missing 
ORGANISM:artif icial sequence 
L;1070 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1070 M:258 W: Mandatory Feature missing 
L:1112 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1112 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1112 M:258 W: Mandatory Feature missing 
L:1154 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1154 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1154 M:258 W: Mandatory Feature missing 
L:1196 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1196 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1196 M:258 W: Mandatory Feature missing 
L:1238 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1238 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1238 M:258 W: Mandatory Feature missing 
L:1280 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1280 M:258 W: Mandatory Feature missing 
ORGANISM: art i f icial sequence 
L:1280 M:258 W: Mandatory Feature missing 
L:1322 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 
L:1322 M:258 W: Mandatory Feature missing 
ORGANISM:artif icial sequence 
L:1322 M:258 W: Mandatory Feature missing 
L:1367 M:258 W: Mandatory Feature missing 
ORGANISM: artificial sequence 



<223> Tag not found for SEQ#:23, <213> 

<223> Blank for SEQ# : 23, Line# : 1028 
<220> Tag not found for SEQ#:24, <213> 

<223> Tag not found for SEQ#:24, <213> 

<223> Blank for SEQ# : 24 , Line# : 1 070 
<220> Tag not found for SEQ#:25, <213> 

<223> Tag not found for SEQ#:25, <213> 

<223> Blank for SEQ# : 25 , Line# : 11 12 
<220> Tag not found for SEQ#:26, <213> 

<223> Tag not found for SEQ#:26, <213> 

<223> Blank for SEQ# : 2 6, Li net : 1 154 
<220> Tag not found for SEQ#:27, <213> 

<223> Tag not found for SEQ#:27, <213> 

<223> Blank for SEQ# : 27 , Linelt : 1 196 
<220> Tag not found for SEQ#:28, <213> 

<223> Tag not found for SEQ#:28, <213> 

<223> Blank for SEQ# : 28, Line# : 1238 
<220> Tag not found for SEQ#:29, <213> 

<223> Tag not found for SEQ#:29, <213> 

<223> Blank for SEQ* : 29, Line# : 1280 
<220> Tag not found for SEQ#:30, <213> 

<223> Tag not found for SEQ#:30, <213> 

<223> Blank for SEQ# : 30, Line# : 1322 
<220> Tag not found for SEQ#:31, <213> 
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L:1367 M:258 W: Mandatory Feature missing, 

ORGANISM: artificial sequence 

L:1367 M:258 W: Mandatory Feature missing, 

L:1^09 M:258 W: Mandatory Feature missing, 

ORGANISM: artificial sequence 

L:1409 M:258 W: Mandatory Feature missing, 

ORGANISM: arti ficial sequence 

L:1409 M:258 W; Mandatory Feature missing, 

L:1451 M:258 W: Mandatory Feature missing, 

ORGANISM: artificial sequence 

L:1451 M:258 W: Mandatory Feature missing, 

ORGANISM: artificial sequence 

L:1451 M:258 W: Mandatory Feature missing, 

L:1493 M:258 W; Mandatory Feature missing, 

ORGANISM: arti ficial sequence 

L:1493 M:258 W: Mandatory Feature missing, 

ORGANISM: artificial sequence 

L:1493 M:258 W: Mandatory Feature missing, 

L:1517 M:258 W: Mandatory Feature missing, 

ORGANISM: artificial sequence 

L:1517 M:258 W: Mandatory Feature missing, 

ORGANISM: artificial sequence 

L:1517 M:258 W: Mandatory Feature missing. 



<223> Tag not found for SEQ#: 31, <213> 

<223> Blank for SEQ# : 31 , Line# : 1367 
<220> Tag not found for SEQ#:32, <213> 

<223> Tag not found for SEQ#:32, <213> 

<223> Blank for SEQ# : 32, Line# : 1409 
<220> Tag not found for SEQ#:33, <213> 

<223> Tag not found for SEQ#:33, <213> 

<223> Blank for SEQ# : 33, Line# : 1451 
<220> Tag not found for SEQ#:34, <213> 

<223> Tag not found for SEQ#:34, <213> 

<223> Blank for SEQ# : 34, Line# : 1493 
<220> Tag not found for SEQ#:35, <213> 

<223> Tag not found for SEQ#:35, <213> 

<223> Blank for SEQ# : 35, Line# : 1517 
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VERIFICATION SUMMARY DATE: 11/14/2003 

PATENT APPLICATION: US/10/699,035 TIME: 10:22:10 



Input Set : A:\molecular marker bateman sequencelisting.txt 
Output Set: N:\CRF4\11142003\J699035.raw 

L:1541 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:36, <213> 
ORGANISM: artificial sequence 

L:1541 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ#:36, <213> 
ORGANISM: artificial sequence 

L:1541 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 36, Line# : 1 54 1 

L:1565 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:37, <213> 
ORGANISM: artificial sequence 

L:1565 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ#:37, <213> 
ORGANISM: artificial sequence 

L:1565 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 37 , Line# : 1565 

L:1589 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:38, <213> 
ORGANISM: artificial sequence 

L:1589 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ#:38, <213> 
ORGANISM: artificial sequence 
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